AGING IN THE RANDOM ENERGY MODEL 



O 
O 
(N 

> 

o 
O 



C/3 



I 

o 
o 



> 
m 

(N 
O 



O 



o 
o 



X 



Gerard Ben Arous 
DMA, EPFL, CH-1015 Lausanne, Switzerland 

Anton Bovier 

Weierstrass-Institut fiir Angewandte Analysis und Stochastik, Mohrenstrasse 39, D-lOlll Berlin, Germany. 

Veronique Gayrard 

DMA, EPFL, CH-1015 Lausanne, Switzerland, and CPT-CNRS, Lummy, Case907, F-13288 Marseille Cedex 9, France 

(February 1, 2008) 

In this letter we announce rigorous results on the phenomenon of aging in the Glauber dynamics 
of the random energy model and their relation to Bouchaud's 'REM-like' trap model. We show 
that, below the critical temperature, if we consider a time-scale that diverges with the system size 
in such a way that equilibrium is almost, but not quite reached on that scale, a suitably defined 
autocorrelation function has the same asymptotic behaviour than its analog in the trap model. 
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Dynamical properties of complex disordered systems 
such as spin glasses, have been at the center of interest 
among theoretical physicists working in statistical me- 
chanics. A key concept that has emerged over the last 
years is that of "aging" : it is applied to systems whose 
dynamics are dominated by slow transients towards equi- 
librium. When discussing aging dynamics, it is all im- 
portant to specify the precise time-scales considered in 
relation to the large volume limit. In "glassy dynamics" 
(see 1^ for an excellent review) one considers the infinite 
volume limit at fixed time t, and then analyses the ensu- 
ing dynamics as t tends to infinity. The typical setting 
where such a program has been carried out is Langevin 
dynamics of sperical mean field spin glasses, such as the 
p-spm SK model Q. Note that even in this setting 
multiple and even infinitely many time-scales may be ob- 
served (e.g. in the SK-model or the 2 -I- 4-spin spherical 
SK model j^). In this context, mathematically rigorous 
results have been obtained recently in |^ only for the 
p — 2 spherical SK-model. 

On a heuristic level, one expects that glassy dynamics 
describes the evolution of a system that is approaching 
"local equilibrium" , or nearest local minima of the free 
energy. However, the standard picture of the spin glass 
phase typically involves a highly complex landscape of 
the free energy function exhibiting many nested valleys 
organized according to some hierarchical tree structure 
(see e.g. P,|l7[). To such a picture corresponds the heuris- 
tic image of a stochastic dynamics that, on time-scales 
that diverge with the size of the system, can be described 
as performing a sequence of "jumps" between different 
valleys at random times those rates are governed by the 
depths of the valleys and the heights of connecting passes 
or saddle points. To capture these features which are ob- 
viously beyond the realm of glassy dynamics, Bouchaud 
and others |p],[7|, pdp^ J^ have introduced an interesting 
ansatz. Their so-called "trap models" are Markov jump 
processes on a state space that simply enumerates the 



valleys of the free energy landscape. While this picture is 
intuitively appealing, its derivation is based on knowledge 
obtained in much simpler contexts, such as diffusions in 
finite dimensional potential landscapes exhibiting a finite 
number of minima. In the systems one is interested in 
here, however, both dimension and number of minima 
are infinite or asymptotically growing to infinity. 

It is thus an important and interesting question to un- 
derstand whether, how, and in which sense the long time 
dynamics of disordered systems such as spin glasses can 
really be described by trap models, and in particular to 
elucidate the precise time-scales to which these models 
refer. To answer this question requires, of course, the 
study of the actual stochastic dynamics of the full sys- 
tem at diverging time-scales, which is, in general, a very 
hard problem. 

In this note we report on the first rigorous results link- 
ing the long-time behaviour of Glauber dynamics to a 
trap model in the context of the "simplest spin glass" |l9| ] 
model, the random energy model (REM) While 
this model is surely far from "realistic" , it offers a num- 
ber of features that are "typical" for what one expects 
in real spin glasses, and its analysis involves already a 
good number of the problems one would expect to find 
in more realistic situations. The main advantage we will 
draw from this is, of course, the fact that the equilibrium 
properties of this model are perfectly understood. 
The REM. We recall that the REM 0^ is defined 
as follows. A spin configuration cr is a vertex of the hy- 
percube Sn = We define the family of i.i.d. 

standard Gaussian random variables {E^}^^Sj^ . We de- 
fine a random (Gibbs) probability measure on Sn, Hf3,N, 
by setting 



where Zp^N is the normalizing partition function. 
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It is well-known |l4|l5[ t hat this model exhibits a phase 
transition at Pc = v2hi2. For P < Pc, the Gibbs mea- 
sure is supported, asymptotically as t oo, on the set 
of states a for which E„ ~ VN /?, and no single configu- 
ration has positive mass. For /3 > /3c, on the other hand, 
the Gibbs measure gives positive mass to the extreme el- 
ements of the order statistics of the family E^; i.e. if we 
order the spin configurations according to the magnitude 
of their energies s.t. 

£'cr(l) > Ea(^) ^ E^(3) > . . . > £^£,(2«) (1) 

then for any finite fc, the respective mass /Lt/3_7v(c''^-') will 
converge, as N tends to infinity, to some positive ran- 
dom variable Uk] m fact, the entire family of masses 
l^f3,N{<7^''^), K G IN will converge to a random process 
{j^fcjfcew, called Ruelle's point process ||2l|,[l|]. 

So far the fact that a are vertices of a hypercube has 
played no role in our considerations. It will enter only in 
the definition of the dynamics of the model. The dynam- 
ics we consider is a Glauber dynamics cr(t) on Sn with 
transition rates 

p^{a,v) = e-^^^', 

when a and r] differ by a single spin flip. Note that the 
dynamics is also random, i.e. the law of the Markov 
chain is a measure valued random variable on that 
takes values in the space of Markov measures on the path 
space . We will mostly take a quenched point of view, 
i.e. we consider the dynamics for a given fixed realization 
of the disorder. 

It is easy to see that this dynamics is reversible with 
respect to the Gibbs measure fJ.[3^N- On also sees that it 
represents a nearest neighbor random walk on the hyper- 
cube with traps of random depths. 

The REM-like trap model. The idea suggested by 
the known behavior of the equilibrium distribution is that 
this dynamics, for /3 > /3c, will spend long periods of time 
in the states . etc. and will move "quickly" 

from one of these configurations to the next. Based on 
this intuition, Bouchaud et al. proposed the "REM- 
like" trap model: consider a continuous time Markov pro- 
cess Zm whose state space is the set Sm = {1, ■ • • , M} of 
M points, representing the M "deepest" traps. Each of 
the states is assigned a positive random energy Ek which 
is taken to be exponentially distributed with rate one. If 
the process is in state fc, it waits an exponentially dis- 
tributed time with mean proportional to e^*^", and then 
jumps with equal probability in one of the other states 
fc' e Sm. 

This process can be analyzed using techniques from 
renewal theory. The point is that if one starts the pro- 
cess from the uniform distribution, it is possible to show 
that the counting process, c(t), that counts the number 
of jumps in the time interval (0,t], is a classical renewal 
(counting) process flq]; moreover, as n 1 oo, this renewal 



process converges to a renewal process with a determin- 
istic law for the renewal time with a heavy-tailed distri- 
bution whose density is proportional to where 
a = /3//3c. 

The quantity that is used to characterize the "ag- 
ing" phenomenon is the probability IIm (i, s) that dur- 
ing a time-interval [t, t -\- s] the process does not jump. 
Bouchaud and Dean showed that, for a > 1, 

hm . 1, 

MToo HQ{s/t) 

where the function Hq is defined by 

Ha(w) EE -— - / dx- —-r- (2) 

^ ^ 7rcosec(7r/Q) {l + x)x^/°' ^' 

Note that Ho{'w) behaves like 1 — Cw^^^^", for small w, 
and like Cw~^^°' for large w. 

Our purpose is to show, in a mathematically rigorous 
way, how and to what extent the REM-like trap model 
can be viewed as an approximation of what happens in 
the REM itself. To this end we first introduce the set 
Tm = {cr^^\ . . . , o-(*^)} of the first M states defined by 
the enumeration 0. Ideally, we would like to start with 
our original process a(t) and construct a new process Ym 
as follows. Let ri, T2, . . . , t„, . . . be the sequence of times 
at which a{t) visits different elements of the (random) 
set Tm- Then set 

M 

Xn,m{^) = X! ^ Icr(''-)=(T(r„)Ir„<t<T„+i 
fc=l 

i.e. X{t) takes the value k during time-intervals at which 
the process a{t) "travels" from a'^^^ to the next element 
of this set. We would hke to say that Bouchaud's process 
Zm approximates, after an N and M dependent rescaling 
of the time, this process X, if and M are large, i.e. that 
in some appropriate sense, for some function c(7V, M), 

ZM{t)^XN.M{c{N,M)t) 

when first iV, then M , and finally t tend to infinity. This 
problem involves two main assumptions: 

1) The process jumps with the uniform distribution from 
any state in Tm to any of the other states in Tm, and 

2) The process observed on the set Tm is, at least asymp- 
totically, a Markov process, in particular, the times be- 
tween visits of two different states in Tm are asymptoti- 
cally exponentially distributed. 

While it appears intuitively reasonable to accept these 
assumptions, they are a) not at all easy to justify and b) 
the second assumption is not even correct. In fact, we will 
see that such properties can be only established in a very 
weak asymptotic form, which is, however, just enough to 
imply that the predictions of Bouchaud's model apply to 
the long time asymptotics of the process. 
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We will now present our main results. The full proofs 
of these results will be given in two forthcoming papers 

ii- 

We define instead of the set Tm introduced above the 
sets, for E € IR, 



T = Tn{E) = {<je Sn\E, > un{E)} 



where 



un{x) = V2iVln2 + 



V2N In 2 
We will call this set the "top". 



1 ln(7Vln2) + ln47r 

2 V2iV In 2 
Note that TNiE) = 



Ti 



\TNiE)\- 



Our first result concerns just the "motion" of the pro- 
cess disregarding time. Let . . . , be an enumer- 
ation of the elements of T{E). Now define (for fixed 
N and E), the stochastic process with state space 
{!,..., \T{E)\} and discrete time i € IN hy 



r(N) 



(3) 



It is easy to see that is a Markov process whose 

transition probabilities p{i,j) are nothing but the prob- 
abilities that the original process a{t) starting at ^* hits 
T first in the point . To formulate our first theo- 
rem it will be convenient to fix the random size of the 
state space of this process by conditioning. Thus set 
PM{-)^P{-\\TiE)\=M). 

Theorem 1 Let cr{n) denote the Markov chain with 
transition matrix defined in (1.3) and whose initial dis- 
tribution is the uniform distribution on Sn- Let Y^'^^ 
be the Markov process defined by |^. Let Yg denote the 
Markov chain on {1, . . . , M} with transition matrix 
given by 



M- 

0, 



if i 
if i= j 



and initial distribution p\j{i) — 1/M. Then, for all AL G 
IN, 

ym^Y, PM-a.s. 



Remark: Note that the statement of the theorem im- 
plies the convergence in law (w.r.t. P) of the probability 
distribution of Y'^^\ 

The next results concern the mean times of the motion 
towards the top and between points of the top. 
Theorem 2. Assume that a = /3/V21n2 > 1. Then 
on a set of probability one for all large enough N, the 
following holds: 

(i) For all rj £ Sn, let T(r]) denote the mean time the 
process starting in rj takes to reach TN{E)\rj. Then 



M -1 



(ii) For all r],fi G T{E), the expected time, Z{rj,ff), to 
reach fj starting from rj conditioned on the event 
that f] is the first site different from rj in T]^(E) 
that is reached, is almost independent of fj, more 
precisely 

\Zirj, fj) - T{7^)\ < ^^WN.EOil/N) (4) 
where Wn,e is a random variable of mean value 
IEWn,e-- 



a-1 

and whose standard deviation is negligible compared 
to the mean, as E tends to —oo. 

(Hi) If a < 1, then, for all a G Sjsi, 

M 



T{a) 



M -1 



Remark: We see that for —E very large, Wn,e ~ 
g/3VlVjiiv(-E)) represents a natural time-scale for the pro- 
cess on T{E). Thus (i) implies that for all a ^ T{E), 
the mean time of arrival in the top is proportional to 
^I3^/Num(E)^ On the other hand, there exists 7? G T{E) 
such that the mean time of first exit from 77 which is 
^/NEjj, is just of this order. Thus the slowest times of 
exit from a state in T{E) are of the same order as the 
time it takes to reach T{E). This can be expressed by 
saying that on the average the process takes a time t to 
reach states that have an exit time t. This is a first mani- 
festation of the aging phenomenon. In contrast, if a < 1, 
for all a G T{E), IEt^^^^^, « sup^^^^ iErJ^^^, and 
thus the time spent in top states is irrelevant compared 
to the time between successive visits of such states. Thus 
we see a clear distinction between the high and the low 
temperature phase of the REM on the dynamical level. 
Notice that, as has been observed in the dynami- 
cal phase transition is not accompanied by a qualitative 
change in the spectral gap, which in all cases is related 
to the largest exit times. For related results on the high 
temperature dynamics see also [ pO[ . 
Remark: Statement iii) of Theorem 2 expresses the fact 
that the mean times of passage from a state 77 G T{E) to 
another state fj G T{E) are asymptotically independent 
of the terminal state fj. This confirms to some extent 
the heuristic picture of Bouchaud. Indeed, if in addition 
we added the hypothesis that the process observed on 
the top is Markovian, then the two preceeding theorems 
would immediately imply that the waiting times must be 
exponentially distributed with rates independent of the 
terminal state and given by ^ This, however, cannot be 
justified. The reason for the failure of the Markov proper- 
ties can be traced to the fact that the time spent outside 
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of T{E) when travelling between two states of T{E) can- 
not be neglected in comparison to the waiting time in 
the starting point, which in turn is a manifestation of 
the absence of a true separation of time-scales. 

We now turn to a more precise analysis of the aging 
phenomenon. 

The natural generalization of Bouchaud's correlation 
function IlMit, s) is the probability that the process does 
not jump from a state in the top to another state in 
the top during a time interval of the form [n,n + m]. 
There is some ambiguity as to how this should be defined 
precisely, but the most convenient definition is to define 
TlainjUi) as the probability that the process starting at 
time in (T does not jump during the time interval [n, n + 
m] from one state in T to another state in T. 

Of course we still have to specify the initial distribu- 
tion. To be as close as possible to Bouchaud, the natural 
choice is the uniform distribution on Tn(E) that we will 
denote by tt^;. The natural correlation function is then 



n(n, m) 



The following theorem establishes the connection to the 
trap model: 

Theorem 3. Let j3 > V 2 In 2 . Then there is a sequence 
cn,e exp{P^/NuN(E)) such that for any e > 



lim lim lim P 



Ho{s/t) 



1 



> e 







where Hq is defined in 

Remark: Note that the rescaling of the time by the fac- 
tor cn,e ensures that we are observing the system first on 
the proper time of equilibration as N goes to infinity, and 
that then, as E tends to minus infinity, we measure time 
on a scale at which the equilibration time diverges. Thus 
the trap model describes the large time asymptotics on 
the "last scale" before equilibrium is reached. This is to 
be contrasted to the other extreme of "glassy dynamics" , 
when the infinite volume limit is taken with a fixed time 
scale. 

The proof of this theorem is rather involved and is the 
main content of |^]. It may be instructive to see a brief 
outline of our approach. Basically the idea is to mimic 
the proof in the trap model and to set up a renewal equa- 
tion. Now it is easy to derive a renewal equation for the 
quantities Ila{n,m). However, in contrast to the situa- 
tion of the trap model, it is not possible to obtain a single 
closed equation for n(m, n). This means that we actually 
have to study a system of renewal equations which ren- 
ders the proof rather complicated. The key ingredients 
then are precise estimates of the Laplace transforms of 
the probability distributions entering the renewal equa- 
tions in the complex plane. What makes the final result 
emerge is then the fact that in the neighborhood of the 
origin (which corresponds to large times), the Laplace 



transforms have almost the desired properties that would 
lead to such a closed equation. This makes it possible to 
employ perturbation expansions to prove the theorem. 
Acknowledgements: We thank J. -Ph. Bouchaud, L. 
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